International Conference on Inter Disciplinary Research in Engineering and Technology [ICIDRET] SO 



A 




D 


n 



International Conference on Inter Disciplinary Research in Engineering and Technology 

[ICIDRET] 



ISBN 


978-81-929742-S-S 


Website 


www.icidret.in 


Received 


14 - February - 20 IS 


Article ID 


1C1DRET007 



Vol 


1 


eMail 


icidret@asdf . res . in 


Accepted 


25 - March - 2015 


eAID 


ICIDRET. 201 5. 007 



Feature Extraction and Classification for ECG Signal 
Processing based on Artificial Neural Network and 
Machine Learning Approach 

Stalin Subbiah\ Rajkumar Patro^ , Subbuthai 
^Research Scholar, Research and Development Center, Bharathiar University, Coimbatore, 
^Associate Professor, Department of Electronics, Guru Nanak Engineering College, Andhrapradesh, 
^Research Scholar, Department of Electronics and Instrumentation, Bharathiar University, Coimbatore. 

Abstract- In present day, several types of developments are carried toward the medical application. There has been increased improvement in the 
processing of ECG signals. Get the accurate detection of ECG signals with the help of detection of V, K and S waveform. However these waveforms 
are suffered from some disturbances like noise. Initially denoising the ECG signal using filters and detect the PQKS waveforms. ECG signal is analyzed 
or classify using Extreme Learning Machine (ELM) and it compared with Support Vector Machine (SVM) and Back Propagation Neural Network (BPN). 
The paper classifies the ECG signal into two classes, Normal and Abnormal. ECG waveform is detected and analyzed using the 48 records of the MIT- 
BIH arrhythmia database. The classifier performance is measured in terms of Sensitivity (Se), Positive Predictivity (PP) and Specificity (SP). 

Keywords: Electrocardiogram, Extreme Learning Machine, Support Vector Machine, Back Prorogation Neural Network, MIT-BIH 
arrhythmia database . 



I INTRODUCTION 



ECG is abbreviated as Electrocardiogram and it is used to represent the electrical activity of the heart. It means that it showing the 
heart muscle contraction and relaxation. Electrocardiography is another term is used to give or provide the heart condition. Analysis of 
the ECG waveform is used to identify the heart normal and abnormalities. Basically ECG waveform has some basic waves they are P, 
Q, R, S, T and U. These waveforms are used to analyses the heart condition. The main important one in ECG is depolarization and 
repolarization. Atrial depolarization is represented by P waveform and ventricular depolarization is represented by QRS complex 
waveform and then repolarization of the ventricle is represented by T waveform [ 1 ] . 

There are several existing methods for ECG waveform analysis and provides the system with accuracy and sensitivity. These methods 
are based on various techniques that are wavelet, RBF Neural network, fuzzy with clustering technique, machine learning of SOM and 
autoregressive modeling [2, 3,4, S and 6]. There are several techniques are analyzed the ECG signal and it is described in [7-1 1]. 

Combination of recurrent neural networks and eigenvectors are used to analyze the ECG signal [12]. Another combination of novel 
method for analyzed the ECG signal using particle swarm optimization and radial basis function neural network [13]. Apart from MIT- 
BIH database, ECG signal is collected from European ST-T database and it is described in [14] for normal and abnormal ECG, ECG 
beat segmentation technique is introduced in [IS]. ECG noise removal by sparse derivatives is explained in [16]. 

The paper is organized as follows. In Section 2, describes about some related work. In Section 3, the proposed ECG algorithm. In 
Section 4, we show how the classification method helps increase the overall detection accuracy. The system is applied to the whole 
MIT-BIH Arrhythmia database and the performance is compared to some other state-of-art methods. 
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II RELATED WORK 

In [18] ECG signal is analyzed using cluster analysis method, this technique consists of three major steps they are extracting the QRS 
waveform, second stage is selecting qualitative features and the third stage is determining heartbeat case. This method analyses and 
classified the normal and abnormal heartbeat. In [17] ECG signal noise reductions are carried through the novel approach of 
combination of discrete wavelet transform and artificial neural network. The work of the wavelet transform is decomposes the ECG 
signal and remove the noise then the second stage of the artificial neural network is implemented the inverse transform and adaptive 
filtering for remove the remaining noise. In [19], author presented a KNN algorithm as a classifier for detection of QRS complex. In 
[20], author has presented an algorithm for detection of R-peak. In [21], authors have used three t)pe of classification that are (i) Back 
Propagation Network (BPN), (ii) Feed Forward Network (FEN) and (iii) Multilayered Perceptron (MLP) 

III METHODOLOGY 



Block diagram of proposed framework is shown in figure 1 . The figure provides that the whole fi-amework is divided into three stages 
that are preprocessing, peak detection, heart beat classification. Input signal are picked from MIT-BIH arrhythmia database s given to 
preprocessing. Preprocessing is used to denoised the ECG signal and this process is given as an input to the next stage. 



ECG Signal from 
MIT-BIH 
Database 



Preprocessing 



Removal of denoising 
the ECG signal 



Detection of Peak 
waveform in ECG signal 



Classification 



V 



Normal 



Abnormal 



Fifrure. 1 Block Diagram of proposed Framework 



Algorithm: QRS Complex 
Input: ECG Signal; Output: QRS Complex 
Initialize Count=0 

Y=Noise Removal of ECG signal; Yl=Differentiation of Y signal 
Y2= Differentiation of Yl signal; Y3= Take Cumulative value of Yl and Y2 signal 
To find out the QRS Complex using threshold value 
Threshold= absolute (maximum (Y3)) 
Threshold 1= Threshold/2; 
for i=l: length (Y3) 
if Y3(i)>Threshold 1 
QRS (i) = 1 

Increment the cunt value and increment the i value 

else 

QRS (i) = 0 

end 
end 

Plot the QRS with the approximate time 



The next stage of the proposed framework is peak detection and QRS complex, before QRS complex ECG signal is decomposed used 
Wavelet transform. It is one of the powerful techniques in biomedical signal processing. QRS complex features are given as input to 
classifier and identified the heartbeat as a normal and abnormal. ECG signal are classified by BPN, SVM and ELM. Compared to BPN 
and SVM, ELM provides the simpler implementation, learning speed is fast and provide better performance 

A. Preprocessing 

Preprocessing is one of the important tasks in signal and image processing. This is the first step for proposed ECG classification. The 
work of the preprocessing is to eliminate the noise in the input ECG signal using various filters approaches. Proposed work handled 
median filter, FIR filter, Gaussian filter and Butterworth filter for noise removal in ECG signal [23]. Preprocess result is used to get 
the better efficiency of the ECG signal. Peak detection efficiency is increased due to preprocessing work. Various filters approaches 
used to remove the noise of baseline wander, power interference, and instrumental error. Elimination of baseline wander is therefore 
needed in the ECG signal analysis to diminish the irregularities in beat morphology. 

B. Peak detection 
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ECG signal is extracted using wavelet decomposition; this is done by Daubechies6 (DB6) multiresolution wavelet. First denotes the 
wavelet decomposition vector and their wavelet bookkeeping vector. Then reconstruct the signal based on wavelet decomposition. 
ECG signal is decomposed up to eight levels. This type of decomposition is used to extract the features and helpful for the selection of 
P, Q, R and S wave. Then all the peaks are identified by minimum and maximum value. Figure 2 illustrates the algorithm for QRS 
complex. Using QRS complex, in this paper find out the heart beat and these beats are given to the classification to deliver the classify 
output as normal and abnormal. 

C. Classification 

Feature extraction output is given as an input to classifier. In this paper three types of classification are carried over to classify as 
normal and abnormal heart beat that are BPN, SVM, and ELM. 

1) Back Propagation Neural Network (BPN) 

Given a finite length input patterns Xj^ (k), X2 (k), ... . , Xj, (k) £ R, (1 < k < k) and the desired patterns 
Xi(k),X2(k) X^(k)GR. 

Step 1 : Select the total number of layers M, the number ni(i = 1,2, ... , M — 1) of the neurons in each hidden layer, and an error 
tolerance parameter £ > 0 . 

Step 2: Randomly select the initial values of the weight vectors W^j^ for 1 = 1,2, ... , rij. 



Step 3: Initialization: 

Step 4: Calculate the neural outputs 



Fori = 1,2, ...,M and] = 1,2, .., 

Step 5 : Calculate the output error 

For j = 1,2, . . , m 

Step 6: Calculate the output deltas 



W^^ ^ W^^(0), E ^ 0,k «- 1 (1) 




(w»)\?- 



(2) 



ej=dj-Xf^^ (3) 



Step 7: Recursively calculate the propagation errors of the hidden neurons 



From the layerM - 1, M - 2, ... , tO layer 1. 

Step 8: Recursively calculate the hidden neural delta values. 



6|'^ = e^a' (sf^) (6) 
<=V^)+^6«xr^ (7) 



Step 9 : Update weight vectors 
Step 10: Calculate the error function 

E = E + ii:,'S,ef (8) 
Step 1 1 : if k=K then go to step 1 2; otherwise, k «— k + 1 and go to step 4. 
Step 1 2 : if E < £ then go to step 1 3 ; otherwise go to step 3 . 
Step 13: learning is completed. Output the weights 

2) Support Vector Machine (SVM) 

Support Vector Machine (SVM) is one of the learning system and it is used mainly in classification. It was developed in the year of 
1998 by Vapnik and it is one of the most techniques to solve the supervised classification problem. In essence, SVM classifiers 
maximize the margin between training data and the decision boundary (optimal separating hyperplane) , which can be formulated as a 
quadratic optimization problem in a feature space. The subset of patterns those are closest to the decision boundary are called support 
vectors. 

Consider a set of training examples(Xj^, yj^), (xi,yi), here input Xj G and class labels y; £ {R'*'} and class labels y; £ 
{— 1, +1}. For a linearly separable classification problem, the construction of a hyperplane is W^X + b = 0 so that the margin 
between the hyperplane and the nearest point is maximized and can be posed as the following quadratic optimization problem : 

min„ i (w^^w) (9) 

Subject to yi((w^Xi) + b > 1), where i = 1, 

Based on below equation, forces a rescaling on (w, b) so that the point nearest to the hyperplane has a distance of (l/||w||) [24]. 
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dating (2), slack variables are 
(10) 
(11) 
(12) 



In many practical situations, a separating hyperplane does not exist. To allow the possibilities of i 
introduced like 

h > 0,i = 1, 

To get 

yi((wTxi) + b) > = 

The optimization problem now becomes as follows: 

min„_5i(wTw) + CXj=i?i 

subject to constraints (10) and (11). The C is a user defined constant. It is called regularizing parameter and determines the balance 
between the maximization of the margin and the minimization of the classification error. 

By introducing Lagrange multipliers OC; and using Karush—Kuhn— Tucker theorem of optimization theory, the solution is given by; 

w = i:l=iyi«iXi (13) 

Only a small fraction of the OCj coefficients are nonzero. The corresponding pairs of Xj entries are known as support vectors and they 
fully define the decision boundary. All other training examples with corresponding zero OCj values are now rendered irrelevant and 
automatically satisfy constraint (4) with = 0. The hyperplane decision function for the vector x can be written as follows 

f(x) = sgn[5:|=iyi (x^Xi) + b] (14) 

By replacing the inner product (x^Xj) with kernel functionK(x, Xj); the input data are mapped to a higher dimensional space [24]. It is 
then in this higher dimensional space that a separating hyperplane is constructed to maximize the margin. 

3) Extreme Learning Machine 

Extreme Learning algorithm use a finite number of input and outputs for training in supervised batch learning system. In this system 
consider N arbitrary distinct samples(Xi, tj) £ X R™, in this X; is an n X 1 input vectors and tj is an m X Itarget vector. If an 
SLFN with N hidden nodes can approximate these N samples with zero error, it then implies that there exist (Bj, and bj such that 



The above equation can be written as 
Where 



fN(Xj)=i:!liPiG(ai,bi,Xj)=tj,j = l, 
Hp = T 



,N 



(IS) 



(16) 



H(ai 



G(aj^,b]^,X]^) 
G(ai,bi,XN) 



.bf^.Xj^, ...,Xn) 
,. G(aN,bN,Xi) 

■ G(a[j,bN,XN) 



(17) 



NXN 







^1 




and T — 






Nxm 





(18) 



Nxm 



H is called the hidden layer output matrix of the network [14]; the i* column of H is the i'*" hidden node's output vector with respect 
to inputs Xj^, X2, ... , Xfj and the jth row of H is the output vector of the hidden layer with respect to input Xj . 

In real applications, the number of hidden nodes, N , wil always be less than the number of training samples, N, and, hence, the 
training error cannot be made exactly zero but can approach a nonzero training error £ . The hidden node parameters a; and b; 
(input weights and biases or centers and impact factors) of SLFNs need not be tuned during training and may simply be assigned with 
random values according to any continuous sampling distribution [9], [10], [11]. Linear system and the output weights P are estimated 
as 

P = H+T (19) 
Where H the Moore-Penrose is generalized inverse [IS] of the hidden layer output matrix H. The ELM algorithm which consists of 
only three steps, can then be summarized as 
ELM Algorithm: Given a training set 

K = {(Xi, ti)|Xi £ R",ti £ R™, i = 1, N}, activation functiong(x), and hidden node number N. 

STEP 1 : Assign random hidden nodes by randomly generating parameters (aj, bj) according to any continuous sampling 
distribution, 1 = 1, . . , N. 

STEP 2: Calculate the hidden layer output matrix H. 
STEP 3: Calculate the output weight P such as P = H^T 

IV EXPERIMENTAL RESULT 



The proposed framework results are carried out in MATLAB. The input for proposed work is collected from the MIT-BIH arrhythmia 
database, and it gives the classification output as Normal and Abnormal. Totally 48 patient records are collected from that database. In 
this S0% are used for training and another S0% is used for training. The classification is done by three methods that are Back 
Propagation Neural Network (BPNN), Support Vector Machine (SVM) and Extreme Learning Machine (ELM). In this paper 
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classification performance are evaluated using some metrics that are Sensitivity, Positive Predictivity and Specificity. The following 
equations are used to calculated these metrics 

TP 

Se(%) = — ^ X 100 (20) 

^ ^ TP+FN ^ ' 

TP 

PP(o/o) = — ii— X 100 (21) 

^ ^ TP+FP ^ ' 

TN 

Sp(o/o) = — ^ X 100 (22) 

^ TN+FP ^ ' 

In the above three equation TP denotes the number of true positive samples, FN indicates the number of false negative samples, TN 

denotes the number of true negative samples and FP indicates the number of false positive samples. These TP, TN, FP and FN are used 

for classification and it is defined as 

FP: Normal class classifies as abnormal; TP: Abnormal class classifies as abnormal; FN: Abnormal classifies as normal. 

TN: Normal class classifies as normal. Then Overall classification accuracy is evaluated using below equation 

_ . Correctly classified samples 

Overall Accuracy (%) = — (23) 

Total Number of samples 



TABLE 1 

OVERALL PERFORMANCE OF ELM, SVM AND BPN 



Methods 


Proposed 
Targets 


Normal 


Abnormal 


Se 
(%) 


PP(%) 


Sp(%) 


Accuracy 




Normal class 


24 


1 










ELM 


Abnormal class 


0 


20 


100 


95 


96 


97 




Total 


24 


21 












Normal class 


22 


3 










SVM 


Abnormal class 


2 


18 


90 


86 


88 


73% 




Total 


24 


21 












Normal class 


20 


5 










BPN 


Abnormal 
class 


2 


18 


90 


78 


72 


64% 




Total 


22 


23 
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Noise reduction using Median filter 





Fifiure 3; (a) Input image (b) Noise Reduction using Median filter 
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Figure 4: Feature Exti'action results of PQRS waveform peak detection 
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Figure 5. QRS Comple 



Figure 3 illustrates the preprocessing output, figure 3(a) shows the original image and 3(b) illustrates the noise reduction using median 
filter. The original image is collected from the MIT-BIH arrhythmia database and each signal is digitized at 360 samples per second per 
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channel with 1 1-bit resolution over a 10 mV range. The preprocessing output of median filter is given as the input to the feature 
extraction. Feature extraction is used to detect the PQRS peak waveform and it is shown in figure 4. Peak detection is done by 
maximum and minimum value of the signal. Figure S illustrates the QRS complex and their heart beats. Table 1 provides the results 
such as overall performance metrics for ELM, SVM and BPN. Classification results are shown and given by a confusion matrix, through 
this confusion matrix accuracy, specificity, sensitivity and Positive Predictivity is produced. Totally 48 ECG signals are picked from the 
MIT BIH arrhythmia database, in this SO signals are used for training and 50 signals are used testing. In testing 25 and 20 signals are 
divided as normal and abnormal. From the confusion matrix of TP, TN, FP and FN, accuracy is calculated and given the output of 97% 
for ELM, 89% for SVM and 84% for BPN it is given in table 1 and also provides the comparative result of individual class sensitivities. 
Positive Predictivity, and specificity of various classifiers such as ELM, SVM and BPN. 

V CONCLUSION 

The proposed paper analysis ECG signals using Extreme learning machine. Classified the ECG signal as normal and abnormal classes 
and it is collected from the MIT-BIH arrhythmia database, 48 records are collected from this database, and split 50 for training and 50 
for testing. PQRS features and QRS complex are extracted in this paper. This extraction is useful for the classification of normal and 
abnormal beats. The method achieves are shown in experimental results that ELM gives the best result that is 97% accuracy, a 
sensitivity of 100%, specificity of 96% and a positive predictively of 95%. Different kinds of noise and artifacts contained in the ECG 
signals of the database are reduced using median filter. 
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